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Abstract. We establish a new class of relations among the multiple zeta values 
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c(fci,...,*i)= £ 



fci ^1 ' 

jii>--->w i >1 n l ' " n k 

which we call the cyclic sum identities. These identities have an elementary proof, 
and imply the "sum theorem" for multiple zeta values. They also have a succinct 
statement in terms of "cyclic derivations" as introduced by Rota, Sagan and Stein. 
In addition, we discuss the expression of other relations of multiple zeta values via 
the shuffle and "harmonic" products on the underlying vector space S) the noncom- 
mutative polynomial ring Q(x,y), and also using an action of the Hopf algebra of 
quasi-symmetric functions on Q(x,y). 



1. Multiple zeta values. Let fci, . . . , h be positive integers with k\ > 1. The 
multiple zeta value ((ki, fc 2 . . . , fc;) (of weight fci H — • + fc; and length I) associated 
with this sequence is the sum of the convergent Z-fold infinite series 



E 



i 



m>n 2 >->n,>l n l n 2 -" n l 

These quantities were introduced under the name "multiple harmonic series" in 
[10], and independently (with an opposite convention for the order of the sequence) 
in [21]. They have appeared in knot theory [15], quantum field theory [4], and in 
connection with mirror symmetry [12]. There are many relations among multiple 
zeta values (henceforth MZVs) , starting with the widely known and often rediscov- 
ered identity £(2, 1) = C(3)- An outstanding example is the "sum theorem", which 
says that the sum of all MZVs of fixed length and weight is independent of length, 
i.e. 

^2 C(ki,k 2 ,---,ki) = ((n). 

{(ki,...,ki)\ki-\ hfci=n,fci>l} 
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This was conjectured in [10] and proved independently by A. Granville [8] and 
D. Zagier. Many other identities have been conjectured and proved in the last 
decade [2,3,15,17,18], but surprising new ones continue to appear. In this paper we 
establish a new class of relations of MZVs, which can be stated as follows. 

Cyclic sum theorem. For any positive integers k\, k 2 , . . . , h with some ki > 2, 

i 

(!) ^2(( k j +l,k j+1 ,...,ki,k 1 ,...,k j - 1 ) = 

3 = 1 

kj-2 

Y ^(kj ~ q,k j+ i,...,h,ki,.. .,kj-!,q+ 1). 

{j\kj>2} q=0 

The name we have given this result will be clearer if we state it in an alternative 
form using the "duality" of MZVs. If s = [k\, . . . ,k{) is an admissible sequence of 
positive integers (i.e., k\ > 1) with sum n, its dual sequence r(s) = (ji, . . . ,j n -i) 
(having sum n and ji > 1) is defined as follows. Let £ be the map that takes a se- 
quence to its sequence of partial sums, and let 3 n be the set of strictly increasing se- 
quences of positive integers with last element at most n. Set t(s) = 'E~ 1 C n R n T,(s), 
where for (ai, . . . , at) £ 3„, 

Rn(ai, ■ ■ ■ ,ai) = (n+ 1 - a,i,n+ 1 - a;_i, . . . ,n+ 1 - ai) 

and C n (a\, . . . , a{) is the complement of {ai, . . . , a^} in {1, 2, . . . , n} arranged in 
increasing order. Then r is an involution on the set of admissible positive-integer 
sequences. Now call two sequences of positive integers cyclically equivalent if one is a 
cyclic permutation of the other, and let II(n, I) be the set of cyclic equivalence classes 
of (not necessarily admissible) positive- integer sequences of sum n and length For 
the juxtaposition sis 2 of two admissible sequences one has t(sis 2 ) — t(s 2 )t(si) 
(see [10], Proposition 3.1), so t(s') is a cyclic permutation of r(s) when s' is an 
(admissible) cyclic permutation of s, and thus any equivalence class [s] £ II(n, I) has 
a dual equivalence class [t(s)] G II(n, n — I). (For example, {(2, 3), (3, 2)} £ 11(5, 2) 
has dual equivalence class {(2, 1, 2), (1, 2, 2), (2, 2, 1)} £ 11(5,3).) The cyclic sum 
theorem can be restated as follows: for any admissible sequence s — (ki, ...,k{) 
with k\ H h ki = n, 

(2) Y C(Pi + l,P2,..-,w) = Y C(<?i + 1,92, • • • 

(pi,...,pi)e[s] (gi,...,g n _;)£[r(s)] 

(For the equivalence of the two forms, see the remarks following Theorem 2.3 below.) 

The cyclic sum theorem has an elementary proof involving partial fractions, but 
admits a remarkably simple expression in terms of "cyclic derivations" (in the sense 
of [20]) of the noncommutative polynomial algebra Q(x, y). In this way it parallels 
an earlier result (Theorem 5.1 of [10], reformulated as Theorem 2.1 below), which 
was proved by an uncnlightcning partial-fractions argument but can be expressed 
very simply in terms of ordinary derivations of Q(x,y). The cyclic sum theorem 
also implies the sum theorem, giving a new proof of this result which does not 
involve generating functions as used by Granville and Zagier. 

We introduce our algebraic machinery in §2: as in [11], we think of MZVs as 
images of monomials under a map C : — ► R, where fj° is an appropriate subspace 



RELATIONS OF MULTIPLE ZETA VALUES 



3 



of Q(x, y). We define derivations and cyclic derivations of Q(x, y) that give rise to 
identities of MZVs, and show how the cyclic sum theorem implies the sum theorem. 
In §3 we prove the cyclic sum theorem by elementary methods. In §4 we return 
to our algebraic viewpoint, recalling the shuffle and "harmonic" products on $)° 
(both of which make £ a homomorphism) , and relating them to Theorem 2.1. In 
§5 we introduce an action of the Hopf algebra QSym of quasi-symmetric functions 
on Q(x, y). We show how Theorem 2.1, the cyclic sum theorem, and the identities 
proved by the second author in [17] can be expressed in terms of this action. 

The first author conjectured the cyclic sum theorem in August 1999, and thanks 
Michael Bigotte for checking it by computer against tables of known relations [1] 
through weight 12. The second author proved the conjecture during his stay at 
the Max-Planck-Institut fur Mathcmatik in Bonn in early 2000, and he thanks 
Masanobu Kaneko and Don Zagier for useful discussions and the Institut for its 
hospitality. The second author is supported in part by the Research Fellowship of 
the Japan Society for the Promotion of Science for Young Scientists. 

2. A noncommutative polynomial algebra and its derivations. In this 
section we introduce an algebraic approach by thinking of MZVs as values of a 
homomorphism from a subspacc of the noncommutative polynomial algebra Q(x, y) 
to the reals. We then consider both derivations and "cyclic derivations" (defined 
below) of Q(x, y), and formulate relations of MZVs in terms of them. We state the 
cyclic sum theorem algebraically (Theorem 2.3), show it is equivalent to the two 
forms (1) and (2) given in §1, and finally prove the sum theorem from the cyclic 
sum theorem. 

Let Q(x,y) be the algebra of polynomials over the rationals in noncommutative 
indeterminates x, y, regarded as a graded Q-algebra with x and y both of degree 1. 
For any word (monomial) w of Q(x, y), denote by \w\ its total degree (also called 
its weight) and by £{w) the number of occurrences of y in w (called the length of 
w). We call \w\ — £(w) the colength of w: it is the number of occurrences of x in 
w. The underlying graded rational vector space of Q(x, y) is denoted $). 

Let f) 1 = Ql © fry and S}° — Ql © xfiy. Then io 1 is a subalgebra of Q(x, y), in 
fact the noncommutative polynomial algebra on generators Zi = x % ~ x y. We have 
fj = 9)x ffi f) 1 and f) 1 = yf) 1 © Sj . We can think of MZVs as images of words of 
Q(x,y) under the Q-lincar map ( : f)° — ► R defined by C(l) = 1 and 

C^ 1 " V fe2_1 y • • • = Cfa, fe. ■■■,*!) 

for any positive integers k\, ki, . . . , ki with k\ > 1. 

Let r be the anti-automorphism of Q(x, y) exchanging x and y, e.g. r(x 2 yxy) = 
xyxy 2 . Evidently t is an involution. Applied to words, r preserves weight and 
exchanges length and colength: note that $j° (but not Sj 1 ) is closed under r. It is 
easy to check that for dual sequences s = (k±, . . . , hi) and r(s) = (ji, . . . ,j n -l) as 
defined in the preceding section, r(x kl ~ 1 y ■ ■ ■ x kl ~ 1 y) = x^^y ■ ■ ■ x^ n - l ' 1 y. 

As usual, by a derivation of Q(x, y) we mean a map F : — > fj (of graded rational 
vector spaces) such that F(uv) — F(u)v + uF(v) for all u, v G f). The commutator 
of two derivations is a derivation, so the set of derivations of Q(x, y) is a Lie algebra 
graded by degree. If 5 is a derivation, then 5 = tSt is also a derivation (of the same 
degree). We call a derivation 5 symmetric if S = S and antisymmetric if S = —5. 
Since [5, e] = [5, e] , symmetric and antisymmetric derivations behave nicely under 
commutator, e.g. [5, e] is antisymmetric if S is symmetric and e antisymmetric. 
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Note that a symmetric or antisymmetric derivation is completely determined by 
where it sends x. 

We denote by D the derivation such that D(x) = and D(y) = xy. In terms of 
the generators Zi of 9) 1 mentioned above, we have D(zi) — z i+ i and more generally 

(3) D ( K Z^ 1 Zi 2 • • • z^ L ) — Zi x -i-i z^ 2 • -\- Zi x Zi 2 +\Zi 3 • • • Zj il -(- • • • -\- z^ 1 • • • z^ l _ 1 Zi l J r \. 

The following result was proved by a partial-fractions argument in [10]. We note 
that the hypothesis on w cannot be weakened, since ((D(y)) = ((xy) ^ = 
C(D(y))- 

Theorem 2.1. For any word w of ft , ((D(w)) = ((D(w)). 

K. Ihara and M. Kaneko [13] have generalized Theorem 2.1 as follows. For 
n > 1 let d n be the antisymmetric derivation with d n (x) — x(x + y) n ~ 1 y; note that 
d 1 =D-D. 

Theorem (Ihara and Kaneko). For alln> 1 and words w off) , ((d n (w)) = 0. 
We shall discuss the proof of this result in §5 below. 

We now consider cyclic derivations of Q(x,y); these are not derivations, but 
rather are defined as follows (cf. [20]). 

Definition. A cyclic derivation ip ofQ(x,y) is a Q-linear map ip : 9) — > Endf), 
where Endf) is the graded rational vector space of endomorphisms of (as a graded 
rational vector space), such that 

(4) (Wi/j),/) - Mfi), hf) + (V>(/ 2 ), fh) 

for all /i, fi, f £ Sj, where (j3, u) denotes the image of u £ under (3 £ End,?). 

If tp is a cyclic derivation, evidently is the zero endomorphism. By induction 
equation (4) is easily extended to the identity 

(5) (V(/l/2 • • • /„),/) = M/l), h ■ ■ ■ U) + W/ 2 ), h ■ ■ ■ fnffl) 

+ •■■ + Wn),//l ■■■fn-l) 

(cf. Proposition 2.3 of [20]; unfortunately the statement has a misprint). 

Just as the conjugate by r of an ordinary derivation is a derivation, it is possible 
to conjugate a cyclic derivation by r as follows. 

Proposition 2.2. Suppose ip is a cyclic derivation. Then the map $ : Sj — > Endij 
is also a cyclic derivation, where 

W/),5)=t(V(t(/)),t( 5 )) 

for f,g £ 9). 

Proof. It suffices to check identity (4), which is routine. □ 

If ip is a cyclic derivation and / £ 9}, the endomorphism gives rise to a 

canonical element (tp(f),l) of 9); we shall abuse notation and write ^p(f) for this 
element of f) when no confusion can arise. Note that as elements of f), ip(fg) = 
ip(gf) and $(f) = t-0(t(/)) for any f,g £f). 
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Now we define the cyclic derivation C of Q(x,y) by setting C(x) = (zero 
cndomorphism) and {C(y), f) — xfy for all / € S). By applying identity (5), it is 
easy to see that 

(C( Zi ), f) = (Cix^y),/) = (C(y)Jx^ 1 ) = xfx^y = xfz t 

for / € $j, and thus (abusing notation as indicated above) that C(zi) — z i+1 . Given 
an arbitrary monomial w — z^z^ ■ ■ ■ z i[ of Sj 1 , we can apply identity (5) to get 

(6) C(w) = z il+1 z i2 ■■■z il + z i2+1 z i3 ■ ■ ■ Zi, z il H h z il+ i Zil ■ ■ ■ z it _ x , 

which may be compared to equation (3). Similarly, it can be shown that 

C{Zi 1 Zi { ) = y ^ y ^ Zi j ^ q Zi j + 1 ■ ■ ■ Zi l Zi 1 ■ ■ ■ Zi j _ 1 Z q +\. 
ij>2 q=Q 

Thus, form (1) of cyclic sum theorem as stated in §1 may be expressed as follows; 
the proof is given in §3 below. 

Theorem 2.3. For any word w off) 1 that is not a power ofy, ((C(w)) = ((C(w)). 

Let s — (ki,...,ki) be an admissible sequence of positive integers, and w = 
x kl ~ 1 y ■ ■ ■x kl ~ 1 y the corresponding word of $)°. In view of equation (6), 

((C(w))=m(w) ^2 C(Pi + 1 ,P2,---,Pi), 
(P!,...,Pi)e[s] 

where [s] is the equivalence class of s in Il(fci + •■• + &{,/) and m(w) is the largest 
integer m such that w — u' m for u e Then form (2) of the cyclic sum theorem is 
equivalent to ((C(w)) = ((Ct(w)) (note m(r(w)) = m(w)), and this is equivalent 
to Theorem 2.3 since £ is r-invariant (see Theorem 4.1 below). 

Theorems 2.1 and 2.3 are of course formally very similar. Both give an equation 
between a sum of MZ Vs of length I and a sum of MZ Vs of length l + l. An important 
difference between the two results is that C is much simpler than D on periodic 
words of .f) . For example, Theorem 2.3 applied to z l n gives 

n-2 

((Zn+lZ 1 - 1 ) = ^C{z n - l Z l ~ 1 Z l+l ). 

8=0 

In the sequence notation, the case n = 3 is 

C(4,3,...,3) = C(3,3,...,3,1) + C(2,3,...,3,2), 

which does not seem to follow easily from other known identities. We close this 
section by deducing the sum theorem from the cyclic sum theorem. 
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Corollary 2.4. For any integers 1 < I < n, let S(n, I) be the sum of words w G $)° 
with \w\ = n and £(w) = I. Then J2weS(n I) C( w ) * s independent of I (and, in 
particular, is equal to ((z n ) = ((n)). 

Proof. Consider the element fi = (x + ty) n ~ 1 - a;" -1 - t n ~ 1 y n ~ 1 e From the 

properties of cyclic derivation (cf. Corollary 2.5 of [20]) we have 

C((x + tyY 1 - 1 ) = (C((x + ty)"- 1 ), 1) = (n - l)(C(x + ty), (x + ty) n - 2 ) 

= (n - WC(y), (x + ty) n - 2 ) = (n - l)tx(x + ty) n - 2 y, 

and thus C(p) = (n — l)(tx(x + ty) n ~ 2 y — t n ~ 1 xy n ~ 1 ): a similar calculation gives 
C(fi) = (n — l)(x(x + ty) n ~ 2 y — x 1l ~ 1 y). For each 1 < I < n — 1, the coefficient of 
t l in ((C(fi) — C(/z)) gives the identity 

E cm- E CH = o 

w£S(n,l) weS(n,l+l) 

after dividing by n — 1. □ 

3. Proof of the cyclic sum theorem. For positive integers fci, k 2 , • • • , ki and 

nonnegative integer fc;+i, let 



1 

„, .,m >„,.,>() ("1 •••"/ 



r(*i,---,*0 = E - - 



and 

5(fei,...,fci,Ai + i)= E 7 — fc, fc, + r 

iii>n2>->m>m+i>0 l n l n l+i) n i ■■■ n i n i+\ 

For the convergence of these series, we have the following. 

Theorem 3.1. T{k\,...,k{) is bounded when one o/fci,...,fc; exceeds 1, 
S(k\, . . . ,ki, k l+ i) is bounded when one of fci, . . . , fc;, fc; +1 + 1 exceeds 1. 

Our key result is as follows. 

Theorem 3.2. For any positive integers ki, ki . . . , ki with ki > 1 for some i, 

fei-2 

T{k 1: ki)-T(k 2 , ...,ki, fci) = Cfa + l. k 2 , ■ ■ ■ , h)- E C(h - J. *2. ■ ■ ■ , 

where the sum on the right is understood as if k\ = 1 . 

To prove the cyclic sum theorem in the form stated in §1, sum Theorem 3.2 over 
all cyclic permutations of the sequence (fci, . . . , ki). 
It is immediate that 

(7) S(ki, . . . , fci, 0) = T(fci, ...,ki) — C(fci + 1, fc 2 , • • • , h). 

Also, applying the identity 

(8) 



rai(m-nj + i) n i+1 \m - n l+1 m 
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to S(h,...,ki,ki +1 ) gives 

(9) 5(fci, ...,ki, fcj+i) = 5(fei - 1, fc 2 , • • • , fcj, fc ;+ i + 1) - ((ki, k t ,ki +1 + 1). 
Finally, applying (8) to 5(1, k 2 , . . . , k h k t+1 ) gives 

v i i 

» 1 >» a >^>» I+1 >o^---»N£. + i 1+1 Ui-m+i -i 
y I f 

ri2>--->n(>n ! + i>0 "2 ■ ■ ■ n l a l+l n 1 =n 2 +l V 1 + 1 



1 + 1 1 

v - V 

n 2 >---> ni >n l + 1 >a n 2 ■■■ n l n l+l j=Q 1 J 



,2>...Jk i+1>J ->o (na-jJnf-nT^ 1 
and so 

(10) 5(1, k 2 ,..., k u k l+1 ) = T{k 2 , ...,k t , k l+1 + 1). 

Proof of Theorem 3.2. Apply equation (7), then equation (9) k\ — 1 times, and 
finally equation (10): 

T{k u ...,h)- C(fci + 1, k 2 , . . . , h) = S(k u ...,k h 0) = 

S(fci-l,As2,...,fc I ,l)-C(fci,...,fc I ,l) = -.- = 

fei-2 

5(1, k 2 , . . . , ki, ki - 1) - ^ ((h -j,k 2 ,...,k t ,j + l) = 

3=0 

fei-2 

T(k 2 ,...,h,ki) - ^2 C( fc i ~j,k 2 ,...,ki,j + l). □ 

Proof of Theorem 3.1. Using equation (7), 

S(ki, . . .,ki,ki+i) < S(ki, ...,ki,0) < T(ki, . . .,h), 

so 5(fci, . . . , h + i) is bounded if T(k\, . . . ,k{) is; and if k\ = 1, equation (10) says 
5(fci, . . . , ki + i) — T(k 2 , . . . ,ki, ki + i + 1). So the statement about the 5's follows 
from the one about the T's. Also, to prove the first assertion it is evidently enough 
to treat the case ki H h ki =1 + 1. Now 

T(2,l,. ..,!)= J2 ' 



nj(m - ni +1 )n 2 ...m 



< E nr^ 

n 1 >n 2 > — >ni>0 1J 
ni>j>0 



l-l 



C(3, !,...,!) + iC(2, 1, . . • , 1) + C(2, 1, . . . , 1, 2, 1, . . . , 1), 



i=i 

i— l 
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so T(2, 1, . . . , 1) is bounded. Then by equations (7) and (10), we have 

T(l, 2, 1, . . . , 1) = 5(1, 2, 1, . . . , 1, 0) + C(2, 2, 1, . . . , 1) 
= T(2,1,...,1,1) + C(2,2,1,...,1) 

and we can continue in this way to bound all the sums T(l, . . . , 1, 2, 1, . . . , 1). □ 

4. Commutative multiplications on ft. There are two commutative multipli- 
cations on the vector space $j, both of which have significance for MZVs. First, 
there is the shuffle product m, which can be defined inductively on words of Sj by 
requiring that it distribute over addition and satisfy the axioms 

51. for any word w, lmw = wml = w; 

52. for any words w\,W2 and a, b G {x, y}, 

aw\mbw2 = a{w\mbw2) + bfawimwz). 

It is evident that (Sj°, ni) is a subalgebra of (Sj, in), and we have the following result. 

Theorem 4.1. £ is a T-invariant homomorphism of (Sj , ni) into R. 

Proof. This follows from the representation of MZVs as iterated integrals (see 
[21,14,11,9]). If we define iterated integrals recursively by 



and 




for ai = f(t)dt, then it is easy to show that for any nonnegative integers pi , P2 , • • ■ ,pi 
with pi > 1, 

Jo 

where ojq = dt/t and u>\ = dt/(l—t). That ( is a homomorphism then follows from 
the fact that iterated integrals multiply according to shuffle products [19]. The 
r-invariance follows from a change of variable. □ 

Second, there is the "harmonic" product * defined on S) by requiring that it 
distribute over addition and satisfy the axioms 
HI. for any word w, l*w = w*l=w; 
H2. for any word w and positive integer p, x p * w — wx p ; 
H3. for any words wi, W2 and positive integers p, q, 

x p ~ 1 ywi*x q ~ 1 yw2 = x p ~ 1 y(wi*x q ~ 1 yw2)+x q ~ 1 y(x p ~ 1 ywi*W2)+x p+q ~ 1 y(wi*W2)- 

As was shown in [11], this defines a commutative and associative product on Sj. 
If f) 1 is regarded as the underlying vector space of the noncommutative algebra 
Q(zi, Z2, ■ ■ ■ ), where Zi = x l ^ 1 y as in §2, then axiom (H3) for words of f) 1 reads 

Z p Wl * Z q W 2 = Z p (Wl * Z q W 2 ) + Z q (z p W! * W 2 ) + Z p+q {uil * W 2 ) , 

which may be compared to (S2). This can be thought of as describing multiplication 
of series, and the following result is proved in [11]. 
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Theorem 4.2. The map ( : (Sj°, *) — > R is a homomorphism. 

On the other hand, we can define a Q-lincar map from <j> : Sj 1 — > Q[[ii, t 2 , ■ ■ • ]], 
where Q[[f 1, £2, ■ • ■ ]] is the Q-algebra of formal series in the countable set of (com- 
muting) variables f 1, t 2 , ■ ■ . , by setting (f)(1) = 1 and 

rii>ri2>--->n; >1 

Then is a homomorphism, and in fact a monomorphism; its image is the algebra 
QSym of quasi-symmetric functions as defined in [7]. A formal power series (of 
bounded degree) in ti,t 2 , ■ ■ ■ is called a quasi-symmetric function if the coefficients 
and t p 1 t P2 ■ ■ ■ t p k and t Pl t P2 ■ ■ ■ t Pk are the same whenever ii < i 2 < ■ • ■ < ik and 
ji < 32 < • • • < 3k ■ Evidently any quasi-symmetric function is a sum of monomial 
quasi-symmetric functions, which can be defined as the elements 4>(zi x ■ ■ ■ Zi k ). 

Since £ : Sj° — > R is a homomorphism for both multiplications, any element of 
the form (u, v) — umv — u * v for u, v G Sj° must be in the kernel of £. Together 
with Theorem 2.1, our next result shows that this remains true for elements of the 
form (y, w), w G 

Theorem 4.3. For words w of $j, ymw — y *w — D(w) — D(w), where D is the 
derivation of §2. 

Proof. We proceed by induction on £(w). If £(w) = 0, then w = x k and we have 

ymx k -y*x k = yx k +xyx k ^ 1 -\ Yx k y-yx k = xyx^ 1 ^ Yx k y = D(x k )-D(x k ). 

Now suppose £(w) — n > and the result holds for words of length less than n. 
Then we can write w — x k yw\ for some word w\ with t(w\) = n — 1. Using the 
inductive definitions (S2) and (H3), we have 

ym[x k ywi) = yx k yw\ + xyx k ~ 1 ywi + • • • + x k y 2 w\ + x k y(ymwi) 
y * (x k yw 1 ) = yx k ywi + x k+1 yw 1 + x k y(y * wi), 

whose difference, assuming the induction hypothesis, is 

(11) xyx k ~ 1 ywi H h x k y 2 w 1 - x k+1 yw x + x k y(D - D)(w 1 ). 

On the other hand, since D — D is a derivation, we have 

(D - D)(x k ywi) = xyx k ~ 1 ywi H h x k y 2 w 1 - x k+1 ywx + x k y(D - D)(wi), 

which agrees with (11). □ 

5. Action of the Hopf algebra QSym on Sj. In this section we put a Hopf 
algebra structure on Jo 1 = QSym, and define an action of this Hopf algebra on 
Q(x, y) that is related to the results of the preceding sections. In particular, we 
state a previous result of the second author in terms of this action, and give a proof 
of the result of Ihara and Kaneko stated in §2. All the algebraic definitions needed 
can be found in [14]. 
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Clearly the algebra QSym of quasi-symmetric functions contains the algebra 
Sym of symmetric functions. In fact, the isomorphism : Sj 1 — > QSym of the 
preceding section takes z n to the power-sum symmetric function p n = J^i™, and 
Zy to the elementary symmetric function e n = ^ il< „. <i t il •■■*»„. Further, QSym 
can be given a Hopf algebra structure that extends the usual Hopf algebra structure 
on Sym (see [6,5]); the primitives are the power-sum symmetric functions p n . If 
A ri^ 1 — ^i^ 1 ®^ 1 is the adjoint of the concatenation product on the generators Zi, 

3=0 

then (Sj 1 , *, A) is a Hopf algebra and the map (f> : Sj 1 ^ QSym is an isomorphism 
of Hopf algebras. Henceforth we shall identify (Sj 1 , *, A) with QSym via <j)\ so z n is 
the nth power-sum symmetric function and so forth. 

Now define a Q-linear map • : Sj 1 ® Sj — > Sj as follows. Let 1 • w = w for any 
word w of Sj. For a nonempty word u of Sj 1 , let u ■ x = 0, 

f z*?/, u = z k , 
u ■ y = < 

I 0, otherwise, 

and 

(12) U ■ W\W2 = • Wi)(u" ■ W2) 

u 

for words W\, W2 of Sj, where A(u) = J2u u ' ® M "- Then the coassociativity of A 
implies that u • w G io is well-defined for any words u oi Sj 1 and «; of Sj. The 
following fact is immediate from the definitions. 

Proposition 5.1. For words w of Sj, Z\ ■ w = D{w); more generally, the linear 
map D n : Sj — > Sj defined by D n {w) — z n ■ w is the derivation sending x to and y 
to x n y. 

The map • is related to the multiplication * of the previous section as follows. 

Lemma 5.2. For words u e Sj 1 and w G Sj, u ■ w is the sum of terms in u * w of 
length £(w). 

Proof. We proceed by induction on £(w). If £(w) < 1 then w is cither a power of x 
or of the form x p yx q , and the conclusion is clear from the definition. Now suppose 
the conclusion is true if £(w) < n and let w be a word of length n. Writing u = ZiU\ 
and w — x p ~ 1 ywi, we have (from axiom (H3) above) 

u * w — Zi{u\ * w) + x p ~ 1 y(u * wi) + x t+p ~ 1 y(ui * Wi). 

Note that only the last two terms can contribute words of length £(w). Since 
£(wi) < n, we have by the induction hypothesis 

sum of terms of length £{w) in u * w — x p ~ 1 y(u ■ u>i) + x l+p ~ 1 y(ui ■ u>i). 

But applying equation (12) to u ■ w = ZiU\ ■ x p ~ 1 ywi gives 

u ■ w = (1 • x p ~ 1 y)(u ■ wi) + {zi ■ x p ~ 1 y)(ui ■ wi) 

= x p - 1 y(u-w 1 )+x l+p - 1 y(u 1 -w 1 ). □ 
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Theorem 5.3. The map ■ : Sj 1 ® Sj — > $j is an action of the algebra QSym = Sj 1 
on and in fact makes Q(x,y) a QSym-module algebra. 

Proof. It suffices to show that u ■ (v ■ w) — (u * v) ■ w for words u, v of io 1 and w 
of Sj. But by the lemma, both sides are just the sum of words of length £(w) in 
u * (v * w) — (u * v) * w. □ 

There is a relation between the action and the cyclic derivation C of §2. 

Proposition 5.4. For positive integers n,m, C{x n y m ) — z n ■ xy m . 

Proof. Using identity (5), we have 



C{x n y m ) = {C{x n y m ),l) = {C(y m ) 7 x n )=xy m - 1 x n y + xy m - 2 x n y 2 + --- + x n + 1 y m , 



and the conclusion follows. □ 

This has the following corollary. 
Corollary 5.5. For all n,m> 1, Q{z n ■ xy m ) — ((z m ■ xy n ). 

Proof. By the preceding result, z n -xy m = C(x n y m ); on the other hand, rC{x n y m ) = 
C(T(x n y m )) — C{x m y n ) — z m ■ xy n , and the conclusion follows from Theorems 2.3 
and 4.1. □ 

A result of the second author [17] can be formulated in terms of the QSym-action 
on Q(x,y) as follows. Let h n denote the complete symmetric function of degree n, 
i.e. the sum of all monomials in the Zi of weight n. 

Theorem 5.6. For all integers n>0 and words w of Sj° , ((h n -t(w)) = ((h n -w). 

In view of Proposition 5.1 (and the T-invariancc of £), Theorem 2.1 is the case 
n = 1 of this theorem. In fact, Ihara and Kaneko proved the theorem stated in §2 
by showing it equivalent to Theorem 5.6. The argument that follows is based on 
their proof, but has been recast in terms of the QSym-action. 

If we let be the ring of formal power series in t with coefficients in Sj, then 

the action of QSym on f) extends to an action of QSym[[t]] on $j[[t]]. Since p n = z n 
acts on f) as the derivation D n of Proposition 5.1, the operator 



is an automorphism of Sj[[t\] by the following result. 

Lemma 5.7. Suppose S = tS\ +t 2 S 2 H , where each Si is derivation of 9). Then 



(C(y m )J) = xy m - x fy + xy m ' 2 fy 2 + ■■■ + xfy 



rn 



for any / 6 ij, so 
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is an automorphism of $j[[t]]. 

Proof. First, note that 5 is a derivation of given u = uo + tu\ + t 2 U2 + ■ ■ ■ 

and v = «o + tv\ + t 2 v 2 + • • • in Sj[[t\], the coefficient of t n in 6(uv) is 

Sp(u q V r ) = ^2 (^p( U q) V r + U q 5 p (v r )); 
p-\-q-\-r=n p+q+r—n 

but this is also the coefficient of t n in S(u)v + uS(v). It then follows that exp(<5) is 
an automorphism of Sj[[t\], since for u,v £ f)[[t]] we have 



in 1 n / rl \ 

eMS)(uv) = y: -tW = E ^ E d m^ n ->) = 

n>0 ' ri>0 ' i=Q ^ ' 

S^u) 5 j (v 



E E ^^=exp W (.)exp W (,). □ 
Now iJ(t) = 1 + /lit + /i 2 i 2 + • • • G QSym[[f]], and from the well-known identity 

v 7 n— 1 

(see, e.g., [16]) it follows that a t (u) = H{t) ■ u for u G [[£]] . Setting a t = ra t T, we 
can restate Theorem 5.6 as saying that ((a t (w) — <r t (w)) = for any word w of 
or equivalently (since o- t {ff) C 

(13) a t a^ (u) -uekcrC for all u £ Sj°[[t]\. 

Also, since Hit)- 1 = E(-t), where E(t) = l + eit + e 2 t 2 H = l + yt + y 2 t 2 -\ , 

we have <7 t _1 (w) = E(—t) • u. 

Lemma 5.8. $ = o t a^ x is uniquely characterized (among automorphisms ofSj[[i\] 
that fixt) by the properties 

i. $(x) = x(l - ty)- 1 

ii. $(a; + y) = x + y. 

Proof. To characterize an automorphism $ of Sj[[i\] that fixes t, it is evidently 
enough to know where $ sends x and y; property (i) gives $(x), and then property 
(ii) gives <&(y) = x + y — &(x). To see that <f> = Oto^ 1 satisfies these properties, 
first note that 

a t a^\x) = a t (E(-t) ■ x) = a t (x) = ra t {y) = r{H{t) ■ y) = 
r(y + ih\ ■ y + t 2 h 2 •?/ + •••) = r(y + txy + t 2 x 2 y + • • • ) = x + txy + t 2 xy 2 + ■ ■ ■ , 

and then do a similar calculation to show that a t o-^ 1 {y) = y — txy(l — ty)~ x . □ 
Now consider the derivation 

^— ' n 

n=l 

of $j[{t]]\ by Lemma 5.7, exp(<9 t ) is an automorphism of f)[[t]]. To show that d n (w) G 
kcr( for all n > 1 and w G Sj° is equivalent to (13) (and thus to Theorem 5.6), it 
suffices to prove the following. 
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Theorem 5.9. exp(<9 4 ) = <JtO t ■ 

Proof. We use Lemma 5.8. Since the derivations d n all take z = x + y to 0, it is 
evident that exp(<9 t ) satisfies property (ii). To show exp(dt)(x) = x(l — iy) _1 , set 

G(s) = exp(sd t )(x) = f>f(*)^ € *>[[«,*]]■ 

n=0 

Then G(s) is the solution of the initial- value problem G'(s) — d t G(s), G(0) = x. 
We claim that 

(14) CW^-i^lVf 

since the right-hand side also satisfies these conditions; the conclusion then follows 
upon setting s = 1. To verify the claim, let U = (1 — (1 — tz) s )/z and V — 
log(l — tz)/z; then the right-hand side of equation (14) is 

x{\ - Uyy 1 = x(l + Uy + UyUy +■■■) 

and the claim follows from the identities U'(s) = UzV — V, dtU — 0, dt(x) = —xVy, 
and dt(y) — zVy — yVy. □ 
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